Speaker adaptation of continuous density HMMs using multivariate linear regression
نویسندگان
چکیده
1 2 1 1 n j j j n H À @ A AE @ A j j j j j j j j j j j j j dependent @sA models nd dpts the model prmE eters to the new speker y trnsforming the men prmeters of the models with set of liner trnsE formsF he trnsformtions re found using mxE imum likelihood riteri whih is implemented in similr fshion to the stndrd wv trining lgoE rithms for rwwsF fy using the sme trnsformE tion ross numer of distriutions nd pooling the trnsformtion trining dt mximum use is mde of the dpttion dtF his llows the prmeters of ll stte distriutions to e dptedF esults re presented on the IHHH word ee esoure wnE gement wI dtse using ontinuous density qussin mixture rww system with rossEword triE phone modelsF ih stte in ontinuous density qussin mixture rww hs n output distriution mde up of numE er of mixture omponent densitiesF e stte with mixture omponents n e expnded to prlE lel single mixture omponent sttesF hus the se of single mixture omponent sttes is desriedD nd the extension to multiple mixture omponents is strightE forwrdF he proility density of stte generting speeh oservtion vetor of dimension is @ A a I @P A AE @IA where nd AE re the men nd ovrine reE spetively of the output distriution of stte F he dpttion proedure is sed on reEestimting the mens of the stte distriutions using liner trnsE form of the existing menF hus it is ssumed tht in dpting from the s system to the speker dpted @eA system the stte trnsition proilities nd the ovrines of the stte distriutions do not hngeF he s mens re mpped to the unknown h mens @ A y liner regression trnsform estimted from the dpttion dtX a where is the @ C IA trnsformtion mtrix nd is the extended men vetorX a I sf n individul regression mtrix is used for eh stte using smll mounts of dpttion dt will reE sult in very poor estimtes of the mtriesF husD eh regression mtrix is ssoited with mny stte distriutions nd estimted from the omined dtF ying trnsform mtries in this mnner is similr in essene to the tying of sttes or mixtures U V whih m m j n …
منابع مشابه
Speaker Adaptation for Continuous Density HMMs: A Review
This paper reviews some popular speaker adaptation schemes that can be applied to continuous density hidden Markov models. These fall into three families based on MAP adaptation; linear transforms of model parameters such as maximum likelihood linear regression; and speaker clustering/speaker space methods such as eigenvoices. The strengths and weaknesses of each adaptation family are discussed...
متن کاملMaximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
A method of speaker adaptation for continuous density hidden Markov models (HMMs) is presented. An initial speaker-independent system is adapted to improve the modelling of a new speaker by updating the HMM parameters. Statistics are gathered from the available adaptation data and used to calculate a linear regressionbased transformation for the mean vectors. The transformation matrices are cal...
متن کاملSpeaker normalization and adaptation based on linear transformation
We propose novel speaker independent (SI) modeling and speaker adaptation based on a linear transformation. An SI model and speaker dependent (SD) models are usually generated using the same preprocessing of acoustic data. This straightforward preprocessing causes a serious problem. Probability distributions of the SI models become broad and the SI models do not give good initial estimates for ...
متن کاملDiscriminative speaker adaptation with conditional maximum likelihood linear regression
We present a simplified derivation of the extended Baum-Welch procedure, which shows that it can be used for Maximum Mutual Information (MMI) of a large class of continuous emission density hidden Markov models (HMMs). We use the extended Baum-Welch procedure for discriminative estimation of MLLR-type speaker adaptation transformations. The resulting adaptation procedure, termed Conditional Max...
متن کاملRobust speaker adaptation of continuous density HMMS using multilayer perceptron network
The performance of global affine and nonlinear transformations for speaker adaptation in a hidden Markov model (HMM) speech recognition system are compared in this paper. The nonlinear transformation was obtained with a multilayer perceptron network (MLP) which was trained during the adaptation process to transform the mean vectors of the HMMs such that the output probabilities of the HMMs for ...
متن کاملSpeaker adaptation using tree structured shared-state HMMs
This paper proposes a novel speaker adaptation method that exibly controls state-sharing of HMMs according to the amount of adaptation data. In our scheme, acoustic modeling is combined with adaptation to e ciently utilize the acoustic models sharing characteristics for adaptation. The shared-state set of HMMs is determined by using tree-structured shared-state HMMs created from the history rec...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1994